CDS

Accession Number TCMCG078C01850
gbkey CDS
Protein Id KAG0449538.1
Location complement(join(6837..6950,7026..7142,7277..7655,7995..8150,8385..8521,8606..8725,9521..9649,9787..9980,10064..10208))
Organism Vanilla planifolia
locus_tag HPP92_027319

Protein

Length 496aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA633886, BioSample:SAMN14973820
db_source JADCNL010000181.1
Definition hypothetical protein HPP92_027319 [Vanilla planifolia]
Locus_tag HPP92_027319

EGGNOG-MAPPER Annotation

COG_category C
Description Belongs to the aldehyde dehydrogenase family
KEGG_TC -
KEGG_Module M00308        [VIEW IN KEGG]
M00633        [VIEW IN KEGG]
KEGG_Reaction R01058        [VIEW IN KEGG]
KEGG_rclass RC00242        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K00131        [VIEW IN KEGG]
EC 1.2.1.9        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00010        [VIEW IN KEGG]
ko00030        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko01120        [VIEW IN KEGG]
ko01200        [VIEW IN KEGG]
map00010        [VIEW IN KEGG]
map00030        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map01120        [VIEW IN KEGG]
map01200        [VIEW IN KEGG]
GOs GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005737        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGCGGGGACGGGAGTATTCTGCGAGATCATCGATGGGGAGGTCTACAAGTACTACAGCGAGGGGGAGTGGAGGAAGTCGAGCTCCGGGAAGTCGGTTTCCATCGTCAATCCTACCACGAGGAAGACGGAGTATCGAGTTCAAGCGTGCACGCAGGAGGAAGTGAATGAAGTGATGGAGGCGGCGAAGGTGGCGCAGAAGGCGTGGGCGAGGACGCCGTTGTGGAAGAGGGCTGAATTGCTTCATCGGGCGGCAGCTATACTCAAGGAGCACAAGGGGCCGATCGCAGAGTGCCTTGTGAAGGAGATCGCTAAGCCGGCCAAGGATGCCATTTCCGAGGTGGTGCGCTCGGGGGATCTCGTGTCTTACACTGCAGAGGAGGGAGTTAGGATTCTGGGCGAAGGAAAATTTCTGGTATCAGACAGCTTTCCGGGGAACGACAGAAACAAGTACTGCCTTGCTTCCAAGATTCCTCTTGGAGTTGTATTAGCAATTCCACCCTTCAACTATCCTGTGAACCTTGCTGTCTCTAAAATTGGTCCTGCTCTAATAGCTGGCAATGCTCTTGTGCTCAAGCCACCAACTCAGGGTGCGGTTGCAGCACTGCACATGGTCCATTGCTTTCACCTTGCTGGATTTCCTAAAGGCCTGATAAATTGCATTACTGGAAAAGGATCAGAGATAGGTGATTTCCTGACAATGCACCCTGGAGTCAATTGCATAAGCTTCACTGGTGGAGACACTGGAATTGCCATTTCCAAGAAGGCTGGTATGATTCCTCTTCAAATGGAACTTGGTGGTAAAGATGCATGCATTGTTCTTGAAGATGCTGATTTAGATTTGGCTGCTGCAAATATTGTGAAAGGAGGTTTTTCATACAGTGGTCAGAGATGCACTGCTGTAAAAGTAGTTCTTGTGATAAAATCTGTGGCGGATGCTCTCGTGGAGAAAGTAAAAGATAAGGTGGAAAAGCTAACAGTTGGCCCACCGGAGAAGGACTGCGACATCACCCCCGTGGTGACGGAGTCCTCGGCTAATTTCATCGAAGGTTTGGTTATTGATGCGAAGGAAAAAGGTGCAACCTTTTGCCAGGAGTACAGAAGGGAAGGTAATCTCATATGGCCCTTGCTTCTGGATCATGTGAGGCCTGACATGAGGATTGCTTGGGAAGAACCCTTTGGGCCTGTGTTGCCAGTTCTTAGGATTAATTCGGTCGAGGAGGGCATTCACCATTGCAATGCCAGCAATTTTGGTCTTCAGGGCTCCGTCTTCACTCGAGACATCAACAAGGCCATTTTGATTAGTGATGCCATGGAGACGGGTACGGTTCAGATCAACTCTGCACCCGCCCGTGGACCGGACCATTTCCCTTTCCAGGGTCTAAAGGACAGTGGTATTGGATCCCAGGGGATCACAAACAGCATCAACATGATGACCAAGATCAAGAGCACGGTCATCAACCTCCCAACTTCATCGTACACTATGGGCTGA
Protein:  
MAGTGVFCEIIDGEVYKYYSEGEWRKSSSGKSVSIVNPTTRKTEYRVQACTQEEVNEVMEAAKVAQKAWARTPLWKRAELLHRAAAILKEHKGPIAECLVKEIAKPAKDAISEVVRSGDLVSYTAEEGVRILGEGKFLVSDSFPGNDRNKYCLASKIPLGVVLAIPPFNYPVNLAVSKIGPALIAGNALVLKPPTQGAVAALHMVHCFHLAGFPKGLINCITGKGSEIGDFLTMHPGVNCISFTGGDTGIAISKKAGMIPLQMELGGKDACIVLEDADLDLAAANIVKGGFSYSGQRCTAVKVVLVIKSVADALVEKVKDKVEKLTVGPPEKDCDITPVVTESSANFIEGLVIDAKEKGATFCQEYRREGNLIWPLLLDHVRPDMRIAWEEPFGPVLPVLRINSVEEGIHHCNASNFGLQGSVFTRDINKAILISDAMETGTVQINSAPARGPDHFPFQGLKDSGIGSQGITNSINMMTKIKSTVINLPTSSYTMG